AIAA  Guidance,  Navigation,  and  Control  Conference  and  Exhibit 
16-19  August  2004,  Providence,  Rhode  Island 


AIAA  2004-5252 


AN  ADAPTIVE  VISION-BASED  APPROACH  TO 
DECENTRALIZED  FORMATION  CONTROL 


Ramachandra  Sattigeri 1  and  Anthony  J.  Calise2 
School  of  Aerospace  Engineering,  Georgia  Institute  of  Technology,  Atlanta,  GA  30332-0150 

Johnny  H.  Evers 3 

Munitions  Directorate,  Air  Force  Research  Laboratory’,  Eglin  AFB,  FL  32542-6810 


In  considering  the  problem  of  formation  control  in  the  deployment  of  intelligent 
munitions,  it  would  be  highly  desirable,  both  from  a  mission  and  a  cost  perspective,  to  limit 
the  information  that  is  transmitted  between  vehicles  in  formation.  In  a  previous  paper,  we 
proposed  an  adaptive  output  feedback  approach  to  address  this  problem.  Adaptive 
formation  controllers  were  designed  that  allow  each  vehicle  in  formation  to  maintain 
separation  and  relative  orientation  with  respect  to  neighboring  vehicles,  while  avoiding 
obstacles.  In  this  paper,  we  consider  a  modification  to  the  adaptive  control  law  that  enables 
each  vehicle  in  a  leader-follower  formation  to  track  line-of-sight  (LOS)  range  with  respect  to 
two  or  more  neighboring  vehicles  with  zero  steady-state  error.  We  also  propose  a 
coordination  scheme  in  which  each  vehicle  tracks  LOS  range  to  up  to  two  nearest  vehicles 
while  simultaneously  navigating  towards  a  common  set  of  waypoints.  This  coordination 
scheme  does  not  require  a  unique  leader  for  the  formation,  increasing  robustness  of  the 
formation.  As  our  results  show,  such  leaderless  formations  can  perform  maneuvers  like 
splitting  to  go  around  obstacles,  rejoining  after  negotiating  the  obstacles,  and  changing  into 
line-shaped  formation  in  order  to  move  through  narrow  corridors. 


I.  Introduction 

As  demonstrated  in  recent  conflicts,  unmanned  aerial  vehicles  (UAVs)  are  becoming  an  important  component  of 
our  military  force  structure.  UAVs,  operating  in  close  proximity  to  enemy  forces,  provide  real-time  information 
difficult  to  obtain  from  other  sources,  without  risk  to  human  pilots.  Among  the  weapons  employed  by  these  UAVs 
will  be  flocks  of  cooperative  miniature  or  micro  autonomous  vehicles  (MAVs)  operating  in  close  proximity  to 
terrain  or  structures  that  will  gather  information  on  enemy  movements  and,  under  human  supervision,  seek  out, 
identify,  and  attack  targets  of  opportunity.  In  large  groups  of  MAVs  or  small  UAVs,  even  small  percentage 
reductions  in  drag  will  offer  significant  increased  payoffs  in  the  ability  to  maintain  persistent  coverage  of  a  large 
area.  One  concept,  well  known  to  bicyclists,  race  car  drivers,  and  pilots  and  exploited  by  swimming  and  flying 
animals,  is  the  benefit  of  operating  in  the  wake  of  another  vehicle  (or  organism).  Therefore  maintaining  a  formation 
while  at  the  same  time  executing  searches  in  a  congested  environment  will  be  a  primary  requirement.  Stealth  like 
operations  will  also  be  important,  implying  the  need  to  maintain  autonomy  and  to  minimize  communication. 
Maintaining  a  formation  is  also  important  from  this  perspective  so  that  passive  (vision  based)  sensing  can  be  used  to 
ascertain  the  locations  and  behaviors  of  cooperating  MAVs/UAVs. 

Standard  approaches  for  formation  control  include  the  leader-follower,  behavior-based  and  the  virtual  structure 
approaches.  In  leader-follower  based  approach,1 2  one  vehicle  is  designated  as  a  leader  and  the  remaining  vehicles  as 
followers.  The  followers  track  the  range  from  the  leader  and  other  followers  to  desired  values.  The  leader  sets  a 
nominal  trajectory  for  the  formation  to  follow  and  may  cooperate  with  the  followers  in  regulating  range.  In  the 
virtual  structure  approach,  the  entire  formation  is  treated  as  a  single  entity.  '  4  Desired  motion  is  assigned  to  this 
single  entity,  the  virtual  structure,  which  traces  out  trajectories  for  each  member  in  the  formation  to  track.  In 
behavior-based  approaches,5’6  several  desired  behaviors  are  prescribed  for  each  vehicle  and  the  final  control  is 
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derived  from  a  weighting  of  the  relative  importance  of  each  behavior.  Since  in  the  leader-follower  and  virtual 
structure  based  approaches,  coordination  is  with  respect  to  a  central  agent,  the  formation  controls  lack  robustness. 
Behavior-based  approaches  are  decentralized  and  are  significantly  easier  to  implement.  However,  these  are  difficult 
to  analyze  mathematically  and  formation  convergence  to  desired  configurations  is  not  guaranteed. 

Although  imperfectly  understood,  flocking  behavior  of  birds,  schooling  behavior  of  fish,  and  even  studies  of 
swarming  insects  have  provided  inspiration  for  concepts  of  coordinated  multi-vehicle  operation.7  Reynolds8 
introduced  a  model  that  suggests  flocking  is  the  combined  result  of  three  simple  steering  rules  that  each  agent 
follows  independently.  In  this  model,  each  agent  can  access  the  whole  scene’s  geometric  description,  but  flocking 
requires  that  it  react  only  to  flock-mates  within  a  certain  small  neighborhood.  Reynolds  rules  were  validated  in  a 
graph-theoretic  and  Lyapunov  stability  analysis  framework.9 10  Convergence  properties  on  individual  agent  velocity 
vectors  and  relative  distances  were  shown.  Ref.  10  also  provided  a  framework  for  addressing  splitting,  rejoining  and 
squeezing  maneuvers  for  flocks  in  the  presence  of  multiple  obstacles. 

In  our  approach,  we  assume  that  the  vehicles  do  not  communicate  velocity  vector  information.  The  lack  of 
relative  velocity  vector  information  is  treated  as  modeling  uncertainty,  whose  effect  on  line-of-sight  (LOS)  range 
(output)  regulation  is  to  be  canceled  by  the  output  of  an  online  adaptive  neural  network  (NN).2  As  a  result,  each 
vehicle  can  regulate  both  the  range  and  relative  orientation  to  a  leader  and/or  neighboring  vehicle  without  knowing 
the  state  and  control  policy  of  that  vehicle.  It  is  assumed  that  each  vehicle  can  measure  its  own  speed,  heading,  range 
and  angle  to  other  vehicles.  The  theory  is  based  on  an  error  observer  approach  to  adaptive  output  feedback  control  of 
uncertain,  MIMO  systems. 1 1  The  approach  is  adaptive  to  both  parametric  uncertainty  and  unmodeled  dynamics.  The 
method  of  Pseudo-Control  Hedging  (PCH)  12,13  is  used  to  protect  the  adaptive  process  from  actuator  limits  and 
actuator  dynamics.  It  is  also  used  to  protect  the  adaptive  process  during  periods  when  it  is  not  in  control  of  the 
vehicle. 

In  this  paper,  we  modify  the  construction  of  the  PCH  signals  so  that  each  vehicle  in  a  leader-follower  formation 
tracks  LOS  ranges  with  respect  to  two  or  more  neighboring  vehicles  with  zero  steady-state  error.  Because  of  the 
robustness  issues  associated  with  a  leader-follower  formation,  we  propose  a  coordination  scheme  that  does  not 
depend  on  a  unique  leader.  In  this  scheme,  which  we  call  a  leaderless  formation  scheme,  each  vehicle  tracks  LOS 
range  to  up  to  two  nearest  vehicles  while  simultaneously  navigating  towards  a  common  set  of  waypoints.  For  a 
vehicle  to  be  tracked,  it  must  lie  within  a  specified  range  from  another  vehicle.  The  leaderless  nature  of  this  scheme 
renders  the  formation  robust  to  failures  in  one  or  more  vehicles.  Changes  in  the  formation  shape  required  while 
negotiating  different  obstacles  are  easier  to  implement  using  this  approach.  Since  the  number  of  vehicles  that  can  be 
tracked  using  this  approach  is  0,  1,  or  2,  the  control  laws  may  be  switching.  Switching  of  the  control  laws  can  lead  to 
the  adaptive  controller  associated  with  tracking  a  particular  neighboring  vehicle  to  not  be  in  control  of  the  plant.  In 
such  cases,  PCH  is  vital  to  the  stability  of  the  adaptation  process.18 

The  organization  of  the  paper  is  as  follows.  The  next  section  summarizes  the  theory  for  the  error  observer 
approach  and  states  the  problem  formulation  for  decentralized  formation  control.  Next,  we  review  the  inverting 
control  design  for  formation  control.  We  also  briefly  discuss  the  static  obstacle  avoidance  controller.  Simulation 
results  for  a  leader-follower  team  of  4  members  regulating  LOS  range  from  each  other  are  shown.  Following  this, 
the  coordination  scheme  for  leaderless  formation  flying  is  described.  Simulation  results  with  this  scheme  are  shown 
for  a  team  of  5  members.  The  results  show  splitting,  rejoining  and  squeezing  maneuvers  in  the  presence  of  obstacles. 

II.  ADAPTIVE  OUTPUT  FEEDBACK  APPROACH 

Consider  the  observable  nonlinear  system  described  by 


x  =  f(x,u) 

y  =  g{x) 

where  rcfic'J?"  are  the  states  of  the  system,  u,y  e  '.It'”  are  the  controls  and  regulated  output  variables 
respectively,  and  /(•,•),  g(-)  are  uncertain  functions.  Moreover  n  need  not  be  known. 
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1.  Assumption 

The  system  in  Eq.  (1)  satisfies  the  condition  for  output  feedback  linearizability  with  vector  relative  degree 
[\,r2,...,rmY ,  r  =  rx  +r2  +...rm  <n  ,14 

Then  there  exists  a  mapping  that  transforms  the  system  into  the  so-called  normal  form: 


z  =  f\Z,z) 

H=g 

;  (2) 

i?  =hi(g,z,u) 

=  yt ,  i  =  1,2, . . . ,  m, 

where  /?,  (£,  %,  u )  =  h,  (x,  u ) ,  where  £  =  [%[  . . .  %Tm  f ,  =  [c,1  . . .  J  and  j  are  the  states  associated  with  the 

internal  dynamics.  Note  that  is  simply  the  j"'  time  derivative  of  yt . 

2.  Assumption 

The  zero  dynamics  are  asymptotically  stable. 

The  objective  is  to  design  an  output  feedback  control  law  that  causes  yt{t)  to  track  a  smooth  bounded  reference 
trajectory  yci(t)  with  bounded  tracking  error. 

A.  Controller  Design  and  Tracking  Error  Dynamics 

Feedback  linearization  is  achieved  by  introducing  the  following  inverse 


u  =  h  *( y,v ) 


(3) 


where 


v  =  h(y,u) 


(4) 


is  the  pseudo-control  signal.  The  pseudo-control  signal  h(y,u)=\hl(y,u),...,htn(y,u)\  represents  an  invertible 
approximation  to  h(x,u)=[hl(x,u),...,hm(x,u)]T  in  Eq.  (2),  which  is  limited  to  using  only  the  available 
measurements  and  control  signal.  If  outputs  other  than  the  regulated  output  are  available  for  feedback,  they  may  also 
be  used  in  Eq.  (3)  to  form  the  approximate  inverse. 

Thus  the  system  dynamics,  as  far  as  the  regulated  output  variable  is  concerned,  is  given  by, 


yr  =v+A 


(5) 


where 


3 

American  Institute  of  Aeronautics  and  Astronautics 


A(£,  Z,  y)  =  h{g,  z,  h  1  O',  V'))-  /?(y,  /;  1  O',  v)) 


(6) 


is  the  inversion  error  that  results  from  the  use  of  Eq.  (3)  in  place  of  an  exact  state  feedback  inverse.  The  pseudo¬ 
control  is  chosen  to  have  the  form 


v  =  v  +  v ,  —  v 

sc  dc  a, 


(7) 


where  y[.  are  generated  by  stable  reference  models  that  define  the  desired  closed-loop  behavior,  vdc  is  the 
output  of  a  dynamic  compensator  designed  to  stabilize  the  linearized  error  dynamics,  and  vad  is  the  adaptive 
component. 

From  Eq.  (5)  and  Eq.  (7),  the  error  dynamics  are  given  as, 


yr  =yrc-yr  =  -ydc  +  vad  -  A0, y)  (8) 

From  Eq.  (6)  and  Eq.  (7)  it  is  seen  that  A  depends  on  vad  through  v,  and  Eq.  (8)  shows  that  vad  has  to  be 
designed  to  cancel  A  .  Therefore  the  following  assumption  is  introduced  to  guarantee  existence  and  uniqueness  of  a 
solution  for  vad . 

3.  Assumption 

The  map  vad  I— >  A  is  a  contraction  over  the  entire  input  domain  of  interest.  It  can  be  shown  that  this  assumption 
leads  to  the  following  conditions15 


i)  sgn 


(  dh  ' 


\dltu 


=  sgn 


/  dh,  ' 


y8u‘; 


,  i  =  1,2,..., m 


ii) 


0^ 

du. 


1 

>  — 
2 


dh, 


du, 


>  0 ,  i  =  1,2, ...,m 


The  first  condition  requires  that  the  sign  of  the  control  effectiveness  is  modeled  correctly  and  the  second  places  a 
lower  bound  on  the  estimate  of  the  control  effectiveness. 

B.  Error  Observer 

It  can  be  shown  that  the  error  dynamics  in  Eq.  (8)  can  be  written  as 


E  =  AE  +  B\yad  -  a] 


(9) 


where  the  elements  of  E  are  made  up  of  yi  and  its  derivatives  up  to  order  (r;  - 1)  and  the  dynamic  compensator 

states.  An  error  observer  is  designed  based  on  this  equation,11  which  results  in  error  estimates  E  that  are  used  in  the 
adaptive  update  law  given  below. 
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C.  Approximation  of  the  Inversion  Error 

The  inversion  error  A  can  be  approximated  to  any  desired  degree  of  accuracy  by  using  a  Single  Hidden  Layer 
Neural  Network  (SHL  NN)  with  sufficient  number  of  hidden  layer  neurons,  and  having  the  following  input  vector, 

16,17 


(10) 


where 


yj  (0  =  NO*  v(e  v(t  -  («,  -  N  )f , 

yl  (0  =  NO’  y{t-d\...,y{t-(nl- \)d)]T 


with  /?,  >  n.  Since  n  is  unknown,  a  sufficient  number  of  delayed  signals  are  required.  The  input-output  map  of  a 
SHL  NN  is  given  by 


Vad  =  WTa{vTx) 


(11) 


where  a  is  the  so-called  squashing  function.  The  NN  is  trained  online  with  the  adaptive  law 


W  =  -Yw\l(a[yTx)-a'[yTxyTx)ETPB  +kw\ 
V  =  -Tv\lxET PBWT a  {yTx)+kv\ 


where  a  (z)  =  diag: 


f  dcr  > 


.  dz  , 

V  i  j 


,  P  is  the  positive  definite  solution  to  the  Lyapunov  equation  A1  P  +  PA  +  Q  =  0 ,  with 


Q> 0,  and  l  lr  and  r,  are  the  adaptation  gains.  It  has  been  shown  that  the  adaptive  law  in  Eq.  (12)  guarantees 
(subject  to  upper  and  lower  bounds  on  the  adaptation  gains)  that  all  error  signals  and  the  NN  weights  are  uniformly 
ultimately  bounded. 1 1 


D.  Pseudo-Control  Hedging  (PCH) 

PCH  is  introduced  to  protect  the  adaptive  law  from  effects  due  to  actuator  limits  (such  as  rate  and  position 
limits),  unmodeled  actuator  dynamics  and  when  the  adaptive  process  is  not  in  control  of  the  plant.1213  The  main  idea 
behind  PCH  methodology  is  to  modify  the  reference  command,  yc ,  in  order  to  prevent  the  adaptive  element  from 
adapting  to  these  actuator  characteristics.  This  is  commonly  done  by  generating  the  command  using  a  reference 
model  for  the  desired  response.  The  reference  model  is  ‘hedged’  by  an  amount  equal  to  the  difference  between  the 
commanded  and  an  estimate  for  the  achieved  pseudo-control.  To  compute  this  difference,  a  measurement  or  estimate 
of  the  actuator  position  u  is  required.  The  pseudo-control  hedge  signal  is  given  by, 


vht  =KV’ucn,d)-hi(y’di\  i  =  1,2,...,' 


(13) 
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where  ycmd.  is  the  external  command  signal,  then  the  reference  model  update  with  PCH  is  set  to 


y 


n 

ci 


m,  (, 


(14) 


The  instantaneous  output  of  the  reference  model  used  to  construct  the  pseudo-control  signal  remains  unchanged 
and  is  given  by 


(15) 


The  block  diagram  of  the  MRAC  controller  architecture  with  PCH  and  error  observer  is  given  below. 


v 


Figure  1.  MRAC  Architecture  with  PCH 


III.  Formation  Control  Formulation 

Consider  a  group  of  N  vehicles  whose  individual  dynamics  are  given  by, 


x,  =  fi(x,,ui),  i  =  1,2, ...,N 


(16) 
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where  xi  represents  the  states  and  ut  the  control  vector  of  the  i'h  vehicle.  Assume  that  vehicles  i  and  j 
cooperate  by  regulating  a  joint  variable  (e.g.,  LOS  range) 


z  =  g[xi,xj) 


(17) 


whose  relative  degree  (r)  is  known,  so  that, 


t(r)  =gr(xi,xjui,uJ) 


(18) 


To  arrive  at  a  decentralized  control  solution,  the  following  approximation  is  employed  by  the  i  vehicle 


=  gri(z,xi,ui)=vi 


(19) 


Equation  (19)  forms  the  basis  for  an  inverting  control  design  in  which  the  inversion  error  is 

A,  =  gr  k ,  Xj ’  uj  )  “  Sri  (z,  xi  ,  w, ) 


(20) 


Vehicle  i  s  inverting  solution  is  augmented  with  a  NN  that  estimates  and  approximately  cancels  A,. .  The 

input  vector  to  the  NN  for  the  i‘h  vehicle  is  given  by  /r,-  =  \xhuid(t),z ,  where  uj(/  (t),zd(t)  are  vectors  of 
sufficiently  large  number  of  delayed  values  of  U/(t),z(t)  respectively.16,17  So,  the  decentralized  control  solution  of 

all  cooperating  aircraft  is  given  by  ut  =  (vq- ,  z,  xt ) ,  where  vi  is  constructed  as  in  Eq.  (7). 

IV,  APPLICATION  TO  FORMATION  CONTROL 

The  formation  of  vehicles  is  constrained  to  lie  in  a  two-dimensional  plane.  The  vehicles  are  considered  to  be 

point-mass  objects  that  can  accelerate  both  along  and  perpendicular  to  the  direction  of  motion.  The  non- 

•  •  •  •  •  •  th  *  *2 

dimensionalized  equations  of  motion  for  the  l  aircraft  are  given  by 


Xj  =  Vj  cos  y/j 

(21) 

yt  =  vi sin  Vi 

(22) 

ir|  ^ 

II 

(23) 

~kn 

(24) 
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where  (xi,yi )  are  the  inertial  position  coordinates,  i//; ,  F;-  are  the  heading  and  speed  variables,  kn,ki2  are 
constants  representing  the  effect  of  drag  forces  and  ,ai2  are  the  controls  representing  non-dimensionalized 
acceleration.  Bounds  are  placed  on  the  controls  to  prevent  slowing  below  the  stall  speed,  and  to  prevent  exceeding 
maximum  bank  angle  limits  and  maximum  and  minimum  longitudinal  acceleration  limits.2  We  model  the  actuator 
system  as  a  saturation  element  with  limits  described  above 


u  =  sat  (ucmd ) 


(25) 


Figure  2  shows  the  variables  involved  in  describing  the  LOS  kinematics.  The  LOS  kinematics  of  the  ith  aircraft 


with  respect  to  the  jth  aircraft  is  given  by 


Figure  2.  LOS  Kinematics 


Vj  cos(^.  -  Ay)- V_  cos(V/i  - ) 
Vj  sin ^f/j  -  Xy )-  Vt  sin(^,.  -  ) 


(26) 

(27) 


The  information  available  to  aircraft  i  include:  Vt ,  <//,  (by  use  of  an  inertial  measuring  unit  IMU),  Kf/ ,  2(/ 
(through  vision-based  sensors)  and  the  control  signals  an ,  ai2 . 


A.  Adaptive  Formation  Control  Design 

We  design  an  inverting  controller  augmented  with  a  NN  for  aircraft  i  for  regulating  the  LOS  range  Rjy  with 
respect  to  aircraft  j  .  The  controller  architecture  is  as  shown  in  Fig  1.  The  relative  degree  of  Ry  with  respect  to  the 
speed  and  heading  of  aircraft  i  is  1 .  Flence  the  range  command  RiJcom  ,  for  the  separation  between  the  aircraft  i  and 
/  ,  is  filtered  through  a  first  order  reference  model.  Figure  3  shows  the  hedged  reference  model.  A  rate  limit  is 
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introduced  so  that  the  reference  model  does  not  command  large  range  rates  when  the  range  error  is  large.  The 
parameter  p  is  the  time  constant  and  is  a  design  parameter. 

The  dynamic  compensator  portion  of  the  pseudo-control  is  a  proportional  error  controller, 
vdC(i  j)  ~  kpj  (^c(,  j  j  -  Rjj ) .  Referring  to  Eq.  (7)  the  pseudo-control  signal  is 


R c(‘.j ) 
RcQj) 


Figure  3.  Hedged  Reference  Model 


=  R 


>+kPi(Rc(i,j)-Rv)-v 


ad  {i,j 


(28) 


The  pseudo-control  signal  V(,  (1  is  the  commanded  LOS  range-rate  for  aircraft  i  with  respect  to  aircraft  j  .  The 
pseudo-control  signal  is  inverted  to  give  a  commanded  velocity  vector  VFC  .  In  case  aircraft  i  is  regulating  LOS 

range  with  respect  to  multiple  neighboring  aircraft,  say  m  >  1  in  number,  then  the  commanded  velocity  vector  for 
aircraft  i  is  given  by  the  vector  sum  of  the  pseudo-control  signals  oriented  along  Ay 2 


Vfc,  -  -  X 

j,i*i 


V(U)  ■ 


(29) 


V)(‘ 


cos  Ay  I  +  sin  Ay 


j) 


(30) 


where  Ay  is  the  unit  vector  along  the  LOS  from  aircraft  i  to  aircraft  j  ,  and  /,  J  are  unit  vectors  aligned  along 
the  X  and  Y  inertial  axes  respectively. 

B.  Hedge  Signals 

Eq.  (29)-(30)  show  that  when  commanding  range  with  respect  to  m  >  1  aircraft,  we  are  actually  trying  to  track  m 
pseudo-control  signals  with  just  one  control  variable,  the  velocity  vector.  This  means  each  aircraft  is  an 
underactuated  system  when  it  commands  range  with  respect  to  multiple  aircraft.  In  this  case,  the  method  of 
calculating  the  hedge  signal  is  special.  We  do  a  non-orthogonal  projection  of  the  actual  velocity  vector  along  each  of 

the  unit  vector  directions  Ay  ,  /  =  !,...  m,  j  ^  / .  Each  of  these  projections  is  treated  as  the  achieved  pseudo-control 

along  the  particular  direction  Ay  .  The  difference  between  the  commanded  pseudo-control  and  the  achieved  pseudo¬ 
control  signal  is  the  hedge  signal.  The  actual  mathematics  for  doing  the  above  calculation  is  shown  below. 
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(31) 


l7/'  a(i,j)^ij  N  ij  [<5,  ] 

jj*i 


where  the  kth  column  of  NtJ  is  and  [or,  ]  is  the  vector  of  elements  ah  ,  for  all  j,  j  *  i .  Note  that  «(/j) 
is  the  estimate  of  the  achieved  pseudo-control  for  aircraft  i  along  the  direction  /L- . 

Thus  we  can  solve  Eq.  (3 1 )  to  obtain 


a\  =  -N-~lV- 

\  -I  »  y  r  I 


(32) 


The  corresponding  expression  for  the  PCH  signal  then  becomes 


v\i,j)=v(u)~aUJ) 


(33) 


The  hedge  signals  go  to  zero  only  when  the  LOS  range  equals  the  commanded  range.  This  is  an  improvement 
over  the  preceding  version  of  eq.  (33)  that  resulted  in  steady-state  errors  whenever  two  or  more  aircraft  were 
tracked. 

The  corresponding  expression  for  inversion  error  A(, ,  is  given  as 


A(ij)=kij-a(ij) 


(34) 


C.  Static  Obstacle  Avoidance 

To  illustrate  the  concept,  it  is  assumed  that  the  obstacles  are  contained  within  bounding  spheres  (circles  in  2 
dimensions),  and  that  the  centers  (X0,Y0)  and  radii  (r)  of  the  obstacles  are  known.  The  goal  of  this  strategy  is  to 
keep  an  imaginary  line  Lo  of  length  D0 ,  originating  at  the  vehicle’s  current  position  and  extending  in  the  direction 
of  the  velocity  vector,  from  intersecting  with  any  obstacle  boundary.  The  length  of  this  line  is  typically  based  upon 
the  vehicle’s  speed  and  maneuverability.  An  obstacle  further  away  than  this  length  D0  is  not  an  immediate  threat. 
The  obstacle  avoidance  behavior  considers  each  obstacle  in  turn  and  determines  if  they  intersect  with  La .  The 
obstacle  which  intersects  La  nearest  the  aircraft  is  selected  as  the  “most  threatening”  and  corrective  steering  action 
is  undertaken  to  avoid  this  obstacle.  If  no  obstacle  collision  is  imminent,  no  steering  action  is  taken.  Corrective 
steering  action  to  avoid  an  obstacle  involves  a  speed  and  heading  change  command.  The  heading  change  command 
ky/OAj  is  towards  the  closest  projected  edge  of  the  obstacle  in  the  local  velocity  fixed  frame  as  shown  in  Fig  4.  The 

output  of  the  static  obstacle  avoidance  controller  is  the  commanded  velocity  vector  VqA  .  Please  refer  to  Ref.  [2]  for 
details  of  constructing  V0A  . 


Craig  Reynolds,  “Not  Bumping  Into  Things,”  http://www.  red  3d.  com/cwr/nobumv/nobumy.  html 


10 

American  Institute  of  Aeronautics  and  Astronautics 


yi 


Figure  4.  Static  Obstacle  Avoidance 


D.  Velocity  Command  Blending 

The  composite  velocity  vector  command  is  given  by  blending  the  outputs  of  the  formation  controller  and  the 
obstacle  avoidance  controller.  The  velocity  command  for  a  follower  vehicle  is 


Vcmdfoll  -  C1  VOAfoll  +  l1  -  C1  )VFCfo„ 


(35) 


The  weight  cl  is  chosen  such  that  obstacle  avoidance  has  higher  priority  than  formation  control.”  The  velocity 
command  for  the  leader  vehicle  is 


Vcmdlead  ~cl VOAlead  +£(i-Ci  )VFClead  +(l“Ci  -k(\-cx))VL 


(36) 


where  VL  is  the  leader  component  of  velocity  command  for  following  a  nominal  trajectory.  The  factor  k  is  set 
equal  to  0.2  implying  that  formation  control  is  the  lowest  priority  for  the  leader. 

E.  Simulation  Results 

We  consider  a  team  of  4  aircraft  flying  in  formation.  Aircraft  1  is  the  team  leader.  It  sets  the  trajectory  for  the 
formation  by  commanding  a  sequence  of  heading  changes  at  specified  time  intervals  y/L  (i)  while  commanding 
constant  speed  VL  .  It  is  desired  that  the  formation  achieve  the  diamond-shape  formation  shown  in  figure  5  in  the 
steady  state.  We  specify  a  set  of  LOS  range  commands  that  are  consistent  with  the  desired  formation  shape. 
Cooperation  between  the  aircraft  is  imposed  by  having  all  aircraft  regulate  LOS  range  from  each  other.  The  aircrafts 
are  referenced  by  the  indices  1,  2,  3,  4.  The  solid  lines  with  arrow  ends  indicate  desired  steady-state  velocity  vectors 
for  the  aircrafts  in  formation. 
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Figure  5.  Desired  Formation  Shape 


We  present  results  for  cases  with  adaptation  (NN  on)  and  without  adaptation  (NN  off).  Hedging  is  on  (H  on) 
only  for  NN  on.  Figures  6  and  7  show  the  trajectory  plot  of  the  formation  with  the  NN  off  and  NN  on  respectively. 
Aircraft  1  starts  at  (0,0),  aircraft  2  at  (6,  -8),  aircraft  3  at  (-8,  -6)  and  aircraft  4  at  (0,-10).  For  the  simulation,  the 
following  values  of  LOS  ranges  between  pairs  of  aircraft  were  commanded: 


R 

R 


12 

com 

14 

com 


=  R, 
=  R 


13 

com 

23 

com 


24 


= 

=  V2 


_  d34  —in 
^ com  LmKJ 


(37) 
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0 
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- Aircraft  1 
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Figure  6.  Formation  Trajectory  (NN  off) 
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Figure  7.  Formation  Trajectory  (NN  on) 

The  desired  formation  shape  is  seen  to  be  achieved  when  there  are  no  obstacles  in  the  path  of  the  aircraft  and 
when  the  leader  vehicle  is  not  commanding  a  heading  change.  The  difference  in  the  Figs  6  and  7  is  the  size  of  the 
box  flown  by  the  formation  with  NN  on.  This  is  because  with  NN  on,  the  formation  flies  at  the  commanded  speed  of 
the  leader  VL  in  steady-state.  With  NN  off,  the  leader  slows  down  for  the  followers  to  catch  up  with  it,  and  the 
formation  flies  at  a  lower  speed  in  the  steady  state.  We  can  thus  infer  that  cooperation  between  the  aircraft  is 
enhanced  with  NN  on.  The  speed  histories  for  the  formation  with  NN  off  and  NN  on  are  shown  in  Figs  8  and  9 
respectively. 
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Figure  8.  Speed  Histories  (NN  off) 


Figure  9.  Speed  Histories  (NN  on) 
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Figures  10  and  1 1  plot  the  error  between  the  commanded  range  and  the  true  range  (. e y  =  R%om  -  Rjj )  between  all 

pairs  of  aircraft.  The  plots  show  steady-state  error  in  commanded  range  with  NN  off  and  zero  steady-state  error 
with  NN  on. 


Figure  12.  Error  in  Commanded  Range  (NN  off) 
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2 


Figure  11.  Error  in  Commanded  Range  (NN  on) 


V.  COORDINATION  SCHEME  FOR  LEADERLESS  FORMATION  FLYING 

The  problem  with  a  leader-follower  formation  control  scheme  is  in  the  concept  of  a  designated  leader.  Such  a 
formation  lacks  robustness  to  a  failure  in  the  leader  vehicle.  Secondly,  it  is  not  practical  to  pre-specify  LOS  ranges 
between  pairs  of  vehicles  for  large  numbers  of  vehicles  in  formation.  Possibilities  of  failure  in  one  or  more  follower 
vehicles  further  complicate  this  problem.  So,  we  propose  a  coordination  scheme  that  does  not  depend  on  a  unique 
leader,  is  robust  to  failures  in  one  or  more  vehicles  and  allows  easy  scaling  of  the  formation. 

We  remove  the  assumption  of  a  designated  leader  for  the  formation.  Each  vehicle  now  commands  a  nominal 
velocity  vector  when  not  tracking  any  neighboring  vehicle.  The  nominal  velocity  involves  heading  towards  a  set  of 
waypoints  at  constant  speed.  The  set  of  waypoints  is  common  to  all  the  vehicles.  The  nominal  velocity  vector  Vnom 
is  given  as 


V  =  V, 

r  norrij  r  L 

Vnom,  =  a  tan  2(. ywP  ~  T/  ’XWP  -  xi ) 


(38) 


where  {xWP,yWP)  represent  inertial  coordinates  of  the  waypoints.  Once  the  vehicle  comes  within  a  specified 
distance  of  one  waypoint,  it  starts  heading  towards  the  next  waypoint.  The  order  in  which  the  waypoints  are  tracked 
is  the  same  for  all  vehicles. 

Each  vehicle  tracks  up  to  two  nearest  vehicles  depending  upon  the  range  to  the  vehicle.  The  algorithm  for 
choosing  the  number  of  vehicles  to  track  is  as  follows. 

Let  R\(t)  and  denote  LOS  ranges  to  two  nearest  vehicles.  Let  Rnrdx  >0be  a  constant  and  NV  the 

number  of  vehicles  tracked. 
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If  *l(*)>*max 
NV  =  0 

Elseif  R2{t)<Rmax 
NV  =  2 

Else 

NV  =  \ 

End 

Figure  12.  Logic  for  Choosing  Number  of  Vehicles  to  Track 

The  formation  control  objective  is  to  regulate  range  from  NV  number  of  nearest  vehicles  to  Rcom  .  The 
value  for  Rcom  is  such  that  0  <  Rcom  <  Rmax  and  is  a  constant  for  all  the  vehicles  in  the  formation. 

When  the  number  of  nearest  neighbors  NV  changes,  the  control  law  switches.  Switching  of  the  control  laws  also 
takes  place  when  a  nearest  neighbor  is  replaced. 

We  design  adaptive  formation  controllers  to  regulate  LOS  range  from  every  vehicle  in  the  formation,  but 
tracking  takes  place  only  with  NV  number  of  neighbors.  The  implication  is  that  not  all  adaptive  controllers  are  in 
control  of  the  plant,  and  switches  can  take  place  between  the  adaptive  controllers  that  are  in  control  of  the  plant. 
PCH  allows  adaptation  to  continue  safely  when  not  in  control  of  the  plant.18 

Since  the  number  of  vehicles  tracked  may  change  in  time,  the  commanded  velocity  vector  also  changes.  Let 

X\  {t)  and  X2{t)  denote  LOS  angles  with  respect  to  the  two  closest  vehicles,  X\(t)  and  X2(t)  the  associated  LOS 
unit  vectors,  and  VFC\  and  Vpc^  the  commanded  velocity  vectors  for  regulating  range  from  the  two  closest 
vehicles.  Then,  the  velocity  vector  command  is  given  as 


If  NV  =  l 


I* cmd  ClVOA  (l  0  nom  ^(l  c\){^nom  •  X-j  ^jX\  )+  k{\  C  |  jf/.Q 


(39) 


If  NV  =  2 


Vcmd  ~c\  VOA  +(l_cl  i^nom  ~  *(l  ~  C1 )  (fnom  +{^nom  *  ^2  )^2  J+  *(l  _  C1  ^ FC2  ] 


(40) 


If  NV  =  0 

I8 cmd  ~  O  [  ()A  "t"  (l  —  C1  Y'  nom 


(41) 


where  (•]  is  the  dot  product  operator,  and  k  >  0  is  a  tuning  parameter  that  indicates  the  relative  priority  for 
formation  control  with  respect  to  nominal  velocity  vector  tracking. 

Eq.  (39)  and  (40)  are  constructed  with  the  objective  that  the  velocity  vectors  should  converge  to  Vnom  when  the 
commanded  range  errors  are  zero,  Vnom  is  the  same  for  all  vehicles,  and  when  there  are  no  obstacles  to  avoid.  This 


can  be  understood  by  noting  that  (V, 


>  X,  )X- 


l 


V  •  X- 

r  nom  z 


is  the  projection  of  Vnom  along  the  unit  vector 
direction  X\  {x2 ) ,  and  VFC  (v )  is  an  estimate  of  the  velocity  of  the  closest  (second  closest)  neighbor  along 
X\  (x2 )  when  the  commanded  range  errors  are  zero.  So,  the  desired  equilibrium  configuration  for  the  formation  is 


not  reached  unless  ( V. 


'  h  Mi 


Vnom  *  ^  Uo 


equals  V FC{  [vFCi )  =>  Vcmd 


=  V„ 


Simulation  Results 
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The  first  set  of  results  with  the  leaderless  formation  control  scheme  is  shown  for  a  group  of  4  aircraft  with 
identical  nominal  velocity  vectors  Vnom  . 

The  nominal  velocity  vector  is  a  sinusoidal  heading  profile  at  constant  speed 


V  =15 

r  nom  A  • 


V  nom 


(39) 


The  value  of  Rcom  chosen  is  0.5 .  The  initial  positions  of  the  aircraft  were  chosen  such  that  each  aircraft  was 
tracking  2  neighboring  aircraft.  Figure  13  shows  the  trajectory  plot  for  the  formation.  Note  that  the  dimensions  along 
the  x-axis  have  been  scaled  up  in  the  plot  below. 


Figure  13.  Leaderless  Formation  Trajectory  for  Identical  Nominal  Velocity 

Figure  13  shows  the  LOS  range  histories  for  all  aircraft  in  formation.  The  plot  shows  convergence  to  the 
commanded  range  Rcom  of  the  LOS  ranges  from  the  2  closest  neighbors  for  all  aircraft.  The  plot  also  shows  that 
aircraft  1  and  3  are  separated  by  a  range  larger  than  Rcom  in  steady-state,  and  that  aircraft  2  and  4  seem  to  be  at  the 
commanded  range  from  all  aircraft.  This  suggests  that  the  formation  has  split  into  2  groups  with  aircrafts  2  and  4 
common  to  both  groups.  The  splitting  of  the  formation  is  a  common  result  with  the  leaderless  formation  control 
scheme. 
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Figure  13.  LOS  Range  Histories  for  Leaderless  Formation 

Figure  14  shows  the  plot  of  some  of  the  inversion  errors  Ay  plotted  against  the  corresponding  NN  outputs 

v ad {j  ;1  •  The  plot  shows  very  good  tracking  of  the  Ay  .  Note  in  particular  the  first  subplot.  The  signal  t /a(/(ii3)  is  the 

NN  output  of  aircraft  1  designed  to  track  aircraft  3.  From  figure  13,  we  know  that  aircraft  1  does  not  track  aircraft  3 
because  aircraft  2  and  4  are  its  closest  neighbors.  This  shows  that  adaptation  continues  despite  the  adaptive  control 
not  being  in  control  of  the  plant. 
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Figure  14.  Inversion  Error  and  NN  outputs 

Next  we  consider  a  group  of  5  aircraft  tracking  a  sequence  of  waypoints  in  the  counter-clockwise  direction. 
Figure  15  shows  the  trajectory  of  the  formation.  Waypoints  are  marked  in  the  plot  by  red  crosses.  The  plot  shows 
that  the  formation  is  achieved  and  maintained  at  places  where  there  are  no  obstacles.  The  formation  is  also  seen  to 
split  to  go  around  an  obstacle  and  later  rejoin. 

Figure  16  shows  the  LOS  ranges  between  all  pairs  of  aircraft  in  the  formation.  It  can  again  be  concluded  that  the 
formation  has  split  into  groups  by  noting  that  only  some  of  the  LOS  ranges  have  converged  to  the  commanded  value 

D 

^ com  • 
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Figure  15.  Leaderless  Formation  Trajectory  with  Waypoint  Tracking 


Figure  16.  LOS  Range  Histories  for  Leaderless  Formations  with  Waypoint  Tracking 


Figure  17  shows  the  number  of  vehicles  being  tracked  by  every  vehicle  during  the  maneuver.  The  number  of 
neighboring  vehicles  tracked  is  seen  to  change  in  time. 
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Figure  17.  Number  of  Neighbor  Vehicles  (AT)  Tracked 

Figure  18  shows  a  maneuver  in  which  the  formation  changes  from  a  wide  formation  to  a  line-shaped  formation. 
Line-shaped  formations  are  desirable  when  the  formation  is  required  to  squeeze  through  narrow  corridors.  The  line¬ 
shaped  formation  is  achieved  when  each  vehicle  tracks  the  nearest  vehicle  that  lies  in  a  conical  region  in  front  of  it. 
When  there  are  no  vehicles  in  this  conical  region,  each  vehicle  has  nominal  motion  directed  towards  a  waypoint. 
This  waypoint  is  common  to  all  vehicles  and  can  be  considered  to  be  a  point  at  the  entrance  of  the  narrow  corridor. 
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Figure  18.  Transition  to  Line-Shaped  Formation 
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VI.  CONCLUSIONS 

We  have  formulated  a  decentralized  adaptive  guidance  strategy  that  enables  safe  and  coordinated  motion  of  a 
group  of  unmanned  vehicles  in  an  environment  with  obstacles.  We  have  shown  that  adaptation  benefits  by 
enhancing  the  cooperation  between  the  vehicles  in  formation. 

We  have  implemented  two  coordination  schemes  for  formation  control:  leader-follower  formation  scheme  and 
leaderless  formation  scheme.  In  the  adaptive  leader-follower  formation  scheme,  we  have  modified  the  PCH  signal 
construction  to  enable  the  tracking  of  two  or  more  LOS  range  variables  with  zero  steady  state  error.  This  is  an 
improvement  over  our  preceding  result.  The  leaderless  formation  control  scheme  is  proposed  as  a  way  of  dealing 
with  the  robustness  issues  of  the  leader-follower  formation  control  scheme.  The  decentralized  formations  that  result 
from  the  application  of  this  scheme  can  perform  maneuvers  like  splitting  /  rejoining  around  obstacles  and  changing 
into  line-shaped  formation  in  order  to  move  through  narrow  corridors. 
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